Co-referential chains and discourse topic shifts in parallel and comparable corpora

نویسنده

  • Costanza Navarretta
چکیده

In this paper we describe Danish and Italian parallel and comparable texts annotated with (co)referential chains and information about discourse topic shifts, and present an evaluation of the annotation. We also discuss general differences in the way referring expressions are used in Danish and Italian and present the analysis of the relation between the use of types of referring expression and discourse topic shifts in part of the data using the Centering framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

استخراج پیکره‌ موازی از اسناد قابل‌مقایسه برای بهبود کیفیت ترجمه در سیستم‌های ترجمه ماشینی

Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...

متن کامل

Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora

This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In ...

متن کامل

Veins Theory: A Model of Global Discourse Cohesion and Coherence

In this paper, we propose a generalization of Centering Theory (CT) (Grosz, Joshi, Weinstein (1995)) called Veins Theory (VT), which extends the applicability of centering rules from local to global discourse. A key facet of the theory involves the identification of <> over discourse structure trees such as those defined in RST, which delimit domains of referential accessibility for each...

متن کامل

Hybrid Parallel Sentence Mining from Comparable Corpora

Mining for parallel sentences in comparable corpora is much more difficult than aligning sentences in parallel corpora. Sentence alignment in parallel corpora usually exploits simple empirical evidence (turned into assumptions) such as (i) the length of a sentence is proportional with the length of its translation and (ii) the discourse flow is necessarily the same in both parts of the bi-text ...

متن کامل

Older Adults as Discursively Constructed in Taiwanese Newspapers: A Critical Discourse Analysis

This paper uses critical discourse analysis to examine discursive representations of older people in Taiwanese newspapers. A total of 926 references to older people were sampled from 62 articles published in four Taiwanese newspapers from January to August 2013. The findings suggest that, older people were frequently allocated roles suggestive of dependency. Those portrayed in line with the pos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Procesamiento del Lenguaje Natural

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2009